Data Editing and Imputation in Business Surveys Using “R”

نویسنده

  • Elena ROMASCANU
چکیده

Purpose – Missing data are a recurring problem that can cause bias or lead to ineffi cient analyses. The objective of this paper is a direct comparison between the two statistical software features R and SPSS, in order to take full advantage of the existing automated methods for data editing process and imputation in business surveys (with a proper design of consistency rules) as a partial alternative to the manual editing

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Use of R in Business Surveys at the Italian National Institute of Statistics: Experiences and Perspectives

Over the last six years, R has been steadily gaining ground in Istat, since a strategic decision to limit dependence on proprietary technologies (like SAS) was taken. A migration activity of our critical IT tools from SAS to R was carried out (we can cite MAUSS-R for optimal sample allocation, and ReGenesees for the calculation of estimates and sampling errors), and new R packages were develope...

متن کامل

Multivariate Outlier Detection and Treatment in Business Surveys

Multivariate outlier detection based on the Mahalanobis distance with the BACON-EEM algorithm, the TRC algorithm and the ER algorithm is presented and imputation of outliers and further missing values is discussed. The methods are illustrated with a data set on Swedish municipalities. The relation between outliers, influential observations and selective editing is explored. Finally robust multi...

متن کامل

Imputation of parent-offspring trios and their effect on accuracy of genomic prediction using Bayesian method

The objective of this study was to evaluate the imputation accuracy of parent-offspring trios under different scenarios. By using simulated datasets, the performance Bayesian LASSO in genomic prediction was also examined. The genome consisted of 5 chromosomes and each chromosome was set as 1 Morgan length. The number of SNPs per chromosome was 10000. One hundred QTLs were randomly distributed a...

متن کامل

Microdata Imputations and Macrodata Implications: Evidence from the Ifo Business Survey

A widespread method for nowand forecasting economic macro level parameters such as GDP growth rates are survey-based indicators which contain early information in contrast to official data. But surveys are commonly affected by nonresponding units which can produce biases if these missing values can not be regarded as missing at random. As many papers examined the effect of nonresponse in indivi...

متن کامل

اهمیت خویشاوندی ژنتیکی و رکورد فنوتیپی بر صحت ژنومی داده‌های جانهی شبیه‌ سازی شده با استفاده از مدل های حیوانی در حضور اثرات متقابل ژنوتیپ و محیط

The objective of this study was to investigate the role of genetic relationships between training and validation set with considering different ratio of phenotypic records of training set on accuracy of genomic prediction via animal models containing genotype × environment interactions in simulated imputation data. For this purpose, four different scenarios using 15k density containing differen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014